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Introduction. 


Graph  coloring  problems  arise  in  many  practical  situations,  for 
exaii5)le  in  various  timetabling  and  scheduling  problems  (see  for  exan?)le 
[15],  [1*+]).  It  would  be  very  useful  to  be  able  to  determine  quickly 
the  chromatic  number  of  a graph.  However,  it  is  well  known  that  this 

problem  is  NP-con^jlete,  and  thus  we  do  not  expect  to  find  good  algorithms 

% 

for  the  problem  ([1],  [10]).  There  has  been  proposed  a class  of  branch- 
and-bound  algorithms,  which  we  call  here  ^kov  algorithms  (see  [5]).  We 
branch  on  whether  or  not  two  non-ad jacent  vertices  have  the  same  color 
and  bound  by  using  the  fact  that  the  chromatic  number  of  a graph  is  at 
least  the  size  of  any  complete  subgraph.  Zirkov  algorithms  always  explore 
at  least  a 'pruned  Zykov  tree'.  We  shall  prove  in  Section  5 below  that 
for  almost  all  graphs  on  n vertices  every  pruned  Zykov  tree  has 

at  least 

^n(log  n)^/^ 

nodes,  for  some  constant  c > 1 . It  follows  that  any  Zykov  algorithm 
requires  in  probability  more  than  exponential  time. 

E.  L.  Lawler  [11]  has  recently  noted  that  a simple  algorithm 
involving  the  meucLmal  stable  sets  of  a graph  requires  only  exponential 
time.  This  algorithm  is  then  faster  than  the  Zykov  algorithms. 

In  the  next  section  we  give  some  preliminsury  definitions,  including 
those  of  Zykov  trees  and  Zykov  algorithms,  and  in  the  following  section 
we  present  seme  preliminary  lenmas.  After  that,  in  Section  1+  we  investigate 
the  size  of  Zykov  trees.  The  standard  algorithm  for  determining  the 
chromatic  polynomial  of  a graph  Involves  the  exploration  of  a Zykov  tree 
(see  for  example  [2]  Chapter  15).  In  Section  5 we  investigate  the  size 

1 


of  pruned  ^kov  trees  and  deduce  that  Zykov  algorithms  are  slow.  We  also 
give  a numericcLl  exangple. 

In  Section  6 we  investigate  a backtrack  coloring  algorithm.  We  show 
that  it  is  essentially  the  same  as  a certain  Zykov  algorithm^  and  obtain 
an  upper  bo\ind  for  the  time  it  requires.  Then  in  Section  7 we  give  an 
interpretation  of  our  earlier  results  in  terms  of  the  lengths  of  certain 
proofs  concerning  the  chromatic  number.  The  results  in  this  section  are 
similar  in  spirit  to  some  recent  resiilts  of  V.  Chvatal  [U]  concerning 
stability  numbers  of  gpraphs;  and  indeed  the  research  reported  in  this 
paper  was  initially  motivated  by  discussions  with  Chvatal  concerning 
his  results.  Finally  in  Section  8 we  consider  •minimal’  coloring 
algorithms,  which  may  use  more  colors  than  necessary,  and  investigate 
the  ratio  of  the  number  of  colors  used  to  the  chromatic  nimober.  This 
last  section  is  not  closely  related  in  content  to  the  rest  of  the  paper, 
but  the  resvilts  there  follow  easily  from  lemmas  used  earlier. 


3.  Preliminary  Definitions. 

A proper  coloring  of  a graph  G (without  loops  or  parallel  edges) 
is  a coloring  of  the  vertices  of  G so  that  no  two  adjacent  vertices 
receive  the  same  color.  The  color  sets  in  such  a coloring  form  a proper 
partition  of  G . The  chromatic  number  x(G)  is  'tJ'®  least  integer  k 
such  that  there  is  a proper  coloring  of  G using  k colors.  A graph  is 
complete  if  every  two  vertices  are  adjacent,  and  the  clique  number  u>(g) 
is  the  greatest  number  of  vertices  in  a con^ilete  subgraph  of  G « 

Let  n be  a positive  integer.  We  let  denote  the  set  of  all 

graphs  with  vertex  set  n)  , Throughout  the  paper  p will  be  a 

constant  with  0 < p < 1 and  q will  be  1-p  , A probability  distribution 

is  induced  on  the  set  of  graphs  by  the  statement  that  each  edge  occurs 

independently  with  probability  p . If  k is  a positive  integer  and 

0 < X < 1 a binomial  random  variable  with  peurameters  k and  x is  the 

sum  of  k independent  {0,1}  -random  variables  Xp...,Xj^  such  that 

Prob{Xj^  ■ 1}  ■ X for  i = 1,  ,.,,k  . Thus  the  number  of  edges  in  a graph 

in  is  a binomisil  random  variable  with  parameters  ( ^ ) and  p . 

* 

We  consider  also  the  set  of  all  graphs  with  vertices  the  sets 

of  a partition  of  (1,  ...,n}  , If  k is  an  integer  we  shall  often  confuse 
k and  {k}  . Thus  for  exan^le  we  may  say  that  . The  use  of 

sets  to  label  vertices  is  sinqily  a notational  convenience. 

We  shall  sometimes  make  statements  involving  such  phrases  as  'for 

almost  all  graphs  in  ’ • For  example  Lemma  5.2  below  states  that 

for  almost  all  graphs  G in  ^ 

n n 

x(<\i)  > 1/2  n / log  n . 

This  simply  means  that 

Prob{G€i(^:  x(Gjj)  ^ V2  n /log  n]  -•  1 as  n -♦  <»  . 
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We  now  move  on  towards  our  definitions  of  Zykov  trees  and  ^kov 
algorithms.  Suppose  that  x and  y are  non-adjacent  vertices  in  a 

* r n 

graph  H in  Jt  . Following  [5]  we  define  the  reduced  graphs  H' 

and  H"  . The  former  H'  is  obtained  from  H by  simply  adding  an 
xy  xy 

edge  Joining  x and  y ; and  the  latter  H"  is  obtained  from  H by 

xy 

replacing  the  vertices  x and  y by  a single  new  vertex  xUy 

adjacent  to  each  vertex  to  which  x or  y was  adjacent.  We  say  that 

H'  and  H"  are  obtained  from  H by  an  'edge-addition'  and  a 
xy  xy 

'vertex-contraction'  respectively.  In  any  proper  coloring  of  H either 
X and  y have  different  colors  or  they  have  the  same  color.  Thus  we 
have  the  well  known  result  (see  [15])  that 

X(H)  = min[x(H;y)  ,x(H;;y)}  . (2.1) 

* 

Suppose  that  we  have  a graph  H in  vdiich  is  itself  a leaf  in 

a binary  tree.  Then  branching  at  H involves  choosing  non-adjacent 

vertices  x and  y in  H and  giving  H the  leftson  H'  and  the 

° ° xy 

rightson  H"  . Of  course  we  cannot  branch  at  H if  H is  complete. 

Now  let  G be  a graph  in  • If  we  start  with  the  single  node  G , 

the  root  of  our  binary  tree,  and  branch  repeatedly  we  obtain  a piurtial 
Zykov  tree  for  G . By  (2.1)  we  know  that  x(^)  minimum  value 

of  x(^)  over  all  leaves  L of  any  partisil  Zykov  tree  for  G . 

A Zykov  tree  for  G is  a partial  Zykov  tree  in  idiich  each  leaf  is  a 
conplete  graph.  We  give  below  an  example  of  a Zykov  tree  for  a graph 
in  . (See  also  [2]  Chapter  15,  [5].) 
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Example. 


1 


k » 


We  have  now  described  the  'branching'  process  to  be  used  in  our 
branch-and-bound  algorithms.  The  'boianding'  process  depends  on  the 
obvious  result  that  for  any  graph  G 

x(G)  > <u(G)  . (2.2 

A Zykov  algorithm  is  a branch-and-bound  algorithm  for  determining 
the  chromatic  number  of  a graphs  using  branch  and  bound  processes  as 
described  above.  Such  an  algorithm  has  a subroutine  for  determining 
for  each  graph  H a lower  bound  u>'  (H)  for  u)(H)  (for  exan^jle  by 
finding  a complete  subgraph  of  H ) . Also  it  maintains  a ciirrent  best 
upper  bound  for  the  chromatic  number,  ^diich  is  always  at  most  the  number 


of  vertices  in  any  graph  encountered.  It  operates  on  a graph  G as 
follows.  It  begins  to  (construct  and)  explore  a partial  Zykov  tree 
for  G i starting  with  the  root  G . Suppose  that  at  some  stage  we 
have  explored  a partial  ^kov  tree  T for  G and  we  have  an  upper 
bound  b for  x(^)  • algorithm  chooses  a leaf  L of  T with 

lu'  (L)  < b if  there  is  such  a leaf,  then  branches  at  L and  updates 
the  upper  bound;  if  there  is  no  such  leaf  L the  algorithm  returns 
x(G)  = b and  stops,  A particular  example  of  a Zykov  algorithm  is 
investigated  in  [5]^  and  another  one  in  Section  6 below. 

It  is  easy  to  see  that  a Zykov  algorithm  always  returns  the  correct 
value  for  the  chromatic  nimber  and  then  stops.  Further  if  say  it  conducts 
a depth-first  search  of  the  partial  Zykov  tree  the  storage  requirement 
need  only  be  say  O(n^)  . The  problem  is  that  ^kov  algorithms  are  very 
slow,  even  if  we  suppose  that  the  subroutine  can  always  determine  u)(H) 
exactly  and  without  cost,  and  that  we  can  always  start  with  the  upper  bound 
at  the  actual  value  of  the  chromatic  number.  (Both  these  suppositions  are 
of  course  rather  \mlikely,  since  we  woxild  be  solving  NP-complete  problems 
[1].) 

Given  a Zykov  tree  Z for  a graph  G the  corresponding  pruned 
Zykov  tree  consists  sin^jly  of  the  root  G if  tu(G)  = x(G)  and  otherwise 
is  the  unique  maximal  rooted  subtree  of  Z containing  as  internal  nodes 
precisely  the  nodes  H of  Z with  uj(H)  < x(^)  • Zykov  algorithm 

must  explore  at  least  some  proved  Zykov  tree  for  G . We  shall  prove 
that  pruned  Zykov  trees  are  usually  very  large  and  thus  that  Zykov 
algorithms  are  ususilly  very  slow. 
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Finally  let  us  establish  some  notation.  We  let  ^ denote  the  set 
of  positive  integers  and  Z the  set  of  non-negative  integers.  For  any 


ireal  number  x we  let  f xT  denote  the  least  integer  not  less  than  x 
, and  Lxj  denote  the  greatest  integer  not  more  than  x . Recall  that 

I i 

I 

I ■ q is  a constant  wi-th  0 < q < 1 (except  that  in  part  of  Section  5 we 

i I 

■ , allow  q to  vary).  All  logarithms  are  to  the  base  l/q  unless  otherwise 

f indicated. 


3 . Preliminary  Results. 

In  this  section  we  present  some  necessary  preliminary  lemmas,  vrtiich 

may  be  of  interest  in  their  own  ri^t.  Lemma  ^.1  is  well  known  and  is 

used  only  in  the  proof  of  Lemma  3*2,  which  is  the  most  used  result  in 

this  section.  The  remaining  results.  Lemmas  3*5  to  3.6  concern  the 

'bounded  sequential  coloring  algorithm',  and  are  needed  here  only  for 

the  'converse'  resxilts  in  Sections  and  5 and  for  Section  8. 

Let  m,n  e N and  let  Q = (S^^,  ...,Sj^)  be  a family  of  pairwise 

disjoint  subsets  of  {1, ...,n}  . We  say  that  Q is  proper  for  a graph 

G in  if  no  two  adjacent  vertices  of  G are  in  the  same  set  S^ 

in  Q . For  each  graph  G in  we  define  a 'contracted'  graph  G„ 

n 

as  follows:  the  graph  G^  has  vertices  S, ,...,S  and  an  edge  between 

X lU 

the  vertices  S^^  and  Sj  if  and  only  if  there  is  an  edge  in  G between 
some  vertex  in  the  set  S.  and  some  vertex  in  the  set  S . . Clearly  G 
may  be  formed  from  G by  a sequence  of  vertex-contractions  if  and  only 
if  Q is  proper  for  G . 

Now  let  m,n  € N and  let  Q be  a partition  of  {1,  ,..,n}  into  m 
sets.  It  seems  reasonable  to  think  that  we  are  likely  to  have  more  edges 
in  G^  the  more  equal  in  size  are  the  sets  in  Q . We  prove  below  that 
this  is  true. 

For  any  random  variable  X we  let  F^  denote  its  distribution 
function,  that  is 

Fjj(t)  = Prob{X  < t} 

for  each  real  number  t . Given  two  random  variables  X and  Y we 
write  X < Y in  distribution  if  > Fy(t)  for  each  real,  number  t . 


Lemma  3«1«  Suppose  that  X , Y , Z are  random  variables,  that  X < Y 


in  distribution,  and  that  toth  the  pairs  X,  Z and  Y,  Z sore  independent. 
Then  X+Z  < Y+Z  in  distribution. 

Proof.  For  any  real  number  t , 

> J*  Fy(t-u)  dF^Cu)  S Fy+2(^)  • □ 

Let  m,n  € M and  suppose  that  m is  fixed.  For  each  real  number  q 
with  0 < q < 1 , let  N(q)  be  a binomial  random  variable  with  parameters 
(“)  and  (1-q)  , and  for  each  partition  Q of  {1,  ...,n}  let  N(n,  Q,  q) 
be  the  number  of  edges  in  the  contracted  graph  for  graphs  G in  ^ 

with  edge-probability  (l-q)  . 


Lemma  3»2.  For  each  partition  Q of  {1, ...,n]  into  m sets  we  have 


N(n,  Q,q)  < ) in  distribution. 


(3.1) 


Proof.  We  may  of  course  assume  that  ro  > 2 . We  shall  prove  first 
that  for  each  partition  Q of  {1,  ...,n}  into  m sets  we  have 


N(n,q,q)  < N(q 


r n/ml ' 


) 


in  distribution. 


(5.2) 


Let  Q=  (S^# . . .» Sjjj)  be  a partition  of  {1,  ...,n}  into  m sets; 
let  s^  = js^l  for  i = 1,  ...,m  ; and  suppose  that  s^+1  < s^-l  . Let 
V e Sg  and  let  Q'  be  the  partition  obtained  from  Q by  switching  v 
from  Sg  to  . In  this  psu*t  of  the  proof  of  the  lenma  both  n and 
q will  be  fixed.  Denote  N(n,Q,q)  and  N(n,Q',q)  by  N.  and  N , 
respectively.  In  order  to  prove  (3.2)  it  is  sufficient  to  prove  that 

^ distribution.  (3.3) 
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Prob^X^+Xg  > 2)  = ProbfXj^  = l)Prob{5^  «=  1) 


S.S  S-S 

1 mw,  2 

(1-q  )(l-q  ) 


S.S 

. 1 m 

1-q  - q 


S.S  (s,+s„)s 

2m.  ' 1 2'  m 

+ q 


and  similarly 


Erob{X[  + X^  > 2}  >=  Prob{X£  = l}Prob{X^  = 1} 

(s.+l)s  (s  -l)s 

'1  m-./,  2 ^ mv 

= (1-q  )(i-q  ) 

(s^tl)s 

= 1-q  - q + q 


m 


m 


Now  let  t = q , so  that  0 < "t  < 1 . Then 
Prob{X^  + X^  >2}  - Prob{X^  + X2  > 2} 


s s (s  +1)  (s  -1) 
= t +t^^-t  -t 


^2""^ 

= (l-t)(t^-t^  ) , 


and  this  last  expression  is  non-negative,  since  s^  < s^-l  . But  this 
completes  the  proof  of  (3.5)  and  so  of  (5.2).  We  now  use  (5.2)  to 
prove  (3.1). 

Given  a set  S of  positive  integers  and  a positive  integer  k let 
kS  be  the  set  of  positive  integers  i such  that  T i/kl  is  in  S . 

Given  a partition  Q = (S^,  Sg, . . .,  Sj^)  of  {1,2,  ...,n}  for  some  integer  n 
let  kQ  be  the  partition  (kSj^,kSg, . . .,kSjj^)  of  {1,2,  ...,kn}  . For 
exangple  if  Q is  the  partition  ({1>2},  {5})  of  {1,2,5}  then  2Q  is 
the  partition  ({1^2, 3, U], {5,6})  of  [1,2, ...,6}  . 
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Let  ncW  » let  Q be  a partition  of  {1,2,  ...,n}  into  m sets 

and  let  q be  a real  number  with  0 < q < 1 . Let  Jr  i denote  the 

•h,l-q 


set  with  edge-probabilities  1-q  . Then  N(n,  Q,q)  is  the  sum  of 
( 2 ) independent  {0,1}  random  variables  (l  < i < j < m)  such 

that 


Prob{X^jsl}  = Prob{Ge^  some  vertex  in  is  axijacent  to 

JSilISjl 


some 


vertex  in  Sj  } 


1-q 


1/k^ 

Let  k be  a positive  integer.  Then  N(kn  , kQ , q ' ) is  the  sum  of  ( 

independent  {0,1}  random  variables  (1  < i < j < m)  such  that 

“ Prob{Gei(  some  vertex  in  kS^  is  adjacent 

kn,l-q^/^ 

to  some  vertex  in  kS.} 

D 


I 

= 1 - q 


|Sil|S,l 


Hence  for  each  positive  integer  k , 

N(n, Q,q)  = N(kn,kQ,q^/^  ) in  distribution.  (3.6) 

By  (3.2)  and  (j.6)  for  each  ke  N we  have  that  in  distribution 


N(n,Q,q)  = N(kn  , kQ , q^/^^) 

-F-f 

,2  I m I 


< N\  q- 


2 2 

"^r^l  "*(^m)  k-.oo,  and  so  clearly  (3.1)  holds.  This 


conqjletes  the  proof  of  Lemma  3.2.  □ 
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We  define  an  algorithm  related  to  the  sequential  algorithm  (SA)  for 
coloring  graphs  (see  [8],  [9],  [I5])  and  vtoich  we  call  the  bounded 
sequential  algorithm  (BSA).  We  shall  look  at  graphs  G in  for 
some  n in  M . Suppose  that  we  have  a positive  integer  s . The 
BSA  (bounded  at  s ) acts  on  eeudi  graph  G in  the  same  way  as  the  SA, 
except  that  we  allow  each  color  set  to  contedn  at  most  s elements. 

Thus  the  BSA  (bounded  at  s ) colors  vertex  1 with  color  1 and  then 
colors  the  remaining  vertices  in  increasing  order,  coloring  vertex  i 
with  color  j if  j is  the  least  positive  integer  such  that  vertex  i 
is  not  Eidjacent  to  any  vertex  alreedy  colored  j and  such  that  there 
are  at  most  (s-1)  vertices  already  colored  j . 

Suppose  now  that  we  have  also  a positive  integer  t , For  each 
graph  G in  if  we  shall  be  interested  in  the  family  Q(G)  (=  Q +(g)) 
consisting  of  the  first  t color  sets  constructed  by  the  BSA  (bounded 
at  s );  and  more  interested  in  the  contracted  graph  G*  = • We 

say  that  a family  Q(G)  as  above  is  full  if  each  of  the  t sets  cont€iins 


the  full  s elements. 


Lemma  3«3«  Let  N be  a binomial  random  variable  with  parameters  ( ^ ) 
2 2 
g 

and  q . Then  for  each  non-negative  integer  k 

Prob{G^  misses  at  most  k edges}  > Prob{N  < k}Prob{Q(G^)  is  full]  . 


Proof.  Let  R (=  ft(n, s,t))  be  the  collection  of  all  the  families 
Q(G)  for  graphs  G in  ir^  . Thus  R is  the  collection  of  all  families 
(Sq^,  ...,S^)  of  t disjoint  subsets  of  {!,..., n}  each  of  size  at  most  s 
and  such  that  for  eeu:h  index  i in  {1, . . . , t } and  each  vertex  v in  a 
set  with  index  greater  than  i , if  |Sj^|  < s or  v < u for  seme  vertex 
u in  then  v > u*  for  some  vertex  u'  in  . 


J 
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Let  Q a=  (S^,  be  a family  in  R . Let  X be  the  set  of 

graphs  G in  such  that  no  two  vertices  adjacent  in  G lie  in  the 

same  set  , and  let  Y be  the  set  of  graphs  G in  such  that 

for  each  index  i in  {1,  ...,t}  and  each  vertex  v in  a set  with  index 
greater  than  i , if  |s^l  < s or  v < u for  some  vertex  u in 
then  v is  adjacent  in  G to  some  vertex  u'  in  with  v > u'  . 

Then 

Q(G)  = Q}  = XQY  . 

Now  clearly  in  distribution  we  have 

1e(Gq)1  < 1e(Gq)1  given  GeY  , 

and  conditioning  on  X does  not  affect  the  distribution  of  the  niomber 
of  edges  in  G_  . Thus  in  distribution 

1e(Gq)1  < 1e(Gq)1  given  Q(G)  = Q . (3.7) 

But  now  for  each  X e Z ^ 

Prob{G'  misses  at  most  k edges} 

= Prob{lE(G')l  > (g)  -k] 

= E Prob{ls(G  )|  > ( p ) - k I Q(G)  = Q}Prob{Q(G)  = Q] 

QeR  ^ 

> E Rrob{lE(G  )|  > (p) -k}Prob{Q(G)  = Q]  (by  (3.7)) 

^ E Prob{(  I ) - 1e(Gq)|  < k}Erob{Q(G)  = Q} 

Q full 

= Prob{N  < k}prob{Q(G)  full}  . □ 


Ik 


Lemma  3»U.  For  any  positive  integers  n , s , t with  st  < n 
Prob{Q(G^)  not  full}  < n(l  - . 

Proof.  For  each  graph  G in  and  for  i = 1,  ...,t  let  S^(g) 

denote  the  i-th  set  in  Q(g)  . Then 

t 

{Q(Gj^)  not  full}  = ^U^{|S^(G^)|  < s}  . 

Now  for  each  k < n in  W and  each  graph  G in  Jf  let  a.  (G  ) 

— n n K n 

denote  the  number  of  vertices  of  G^  amongst  the  first  k which  the  SA 
colors  with  the  first  color  (see  [8]).  Then 


Prob{Q(Gjj)  not  full}  < T Prob{  jSj^(G^)  j < s} 

i • 1 


< S (eee  [8]) 

i=l 


Lemma  3.5.  Let  e > 0 and  let  s and  t be  functions  from  M 
to  ^ such  that  s(n)  < (l-e)log  n and  s(n)t(n)  < (l-e)n  for  each 
n in  N . Then 

Prob[Q((^)  full}  - 1 as  n - « . ( 

If  further  s(n)  > (2  log  n)^/^  for  each  n in  W then 

Prob{(^  coM5)lete}  -*  1 as  n -•  ® , ( 


Proof.  By  Lemma  J.li 


log  Prob{Q(G^)  not  full]  < log  n - en/log  n • log  e 


-®  as  n -«  00 


and  so  (5.8)  holds.  Now  suppose  that  s(n)  > (2  log  n)^/^  for  each  n 
in  K1  . If  N is  as  defined  in  Lemma  3.3  then 

0 > log  Prob{N  = 0} 

= ( o ) iog(i  - q®  ) 


2 - V”^) 


-»  0 as  n -•  00  , 


Hence 


Prob{N  =0}-«l  as  n-»oo  . 

Now  (3.9)  follows  from  (3.8),  (3. 10)  and  Lemna  3.3.  □ 


(5.10) 


LCTma_^^.  Let  e > 0 . Then  for  almost  nil  graphs  G in  > there 

n 

is  a proper  partition  R of  G into  at  least  (l-e)n(2  log  n)"^/^ 
sets  such  that  the  contracted  graph  Gj^  is  con5)lete. 

^oof.  Let  s(n)  «*  f (2  log  n)^/^1  and  t(n)  « r(l-e)n(2  log  n)“^/^l 
for  each  n in  /^  . Then  by  Lenina  3.5 

complete)  -♦  1 as  n -*  oo  , (3. 11) 

Now  with  each  graph  G in  we  shall  associate  a proper  partition 
R(G)  related  to  the  proper  family  Q(G)  , and  the  contracted  graph 
* °R(G)  the  contracted  graph  G’  - G^^^j  . Consider  a 

graph  G in  . Suppose  that  the  vertices  of  G not  in  any  set  in 
Q(G)  are  in  increasing  order.  For  i ■ 1,  ...,J  in  turn 
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add  the  vertex  to  the  first  possible  set  in  Q(g)  (that  is,  to  the 

first  set  in  Q(G)  such  that  is  not  adjacent  to  any  vertex  in  the 

set)  and  if  we  cannot  suid  v^  to  any  already  present  set  in  Q(g)  then 

we  add  to  q(g)  a new  singleton  set  {v^}  . In  this  way  we  construct  a 

proper  partition  R(G)  of  G with  at  least  t sets.  Let  G*  be  the 

contracted  graph  • Then  clearly  the  number  of  edges  missing  in 

* 

G is  at  most  the  number  of  edges  missing  in  G'  • Hence  in  particular 
we  have  by  (j.ll)  that 

Prob{G*  complete}  - 1 as  n ® . □ 

Lemmas  3»5  and  are  in  convenient  forms  for  the  present  purposes: 
they  cleeurly  are  not  in  their  strongest  forms. 


U.  Zykov  Trees. 

In  this  section  we  investigate  the  sizes  of  ^kov  trees.  We  have 
three  main  reasons  for  doing  this.  Firstly  the  sizes  of  Zykov  trees  are 
of  interest  in  their  own  ri^t,  for  exanple  if  we  wish  to  determine  the 
chromatic  polynomial  of  a graph  ([2],  Chapter  15);  secondly  some  knowledge 
of  the  sizes  of  Zykov  trees  helps  us  to  interpret  resvilts  on  the  sizes  of 
pruned  Zykov  trees;  and  thirdly  seme  of  the  arguments  which  we  use  here 
are  similar  to  those  we  use  for  proved  Zykov  trees  in  the  next  section. 

There  are  two  theorems  in  this  section.  The  first  shows  in  particular 
that  every  Zykov  tree  for  a given  graph  has  the  same  size,  that  is  the 
same  number  of  nodes.  Given  a graph  G let  us  denote  by  C(g)  the 
number  of  proper  partitions  of  G (that  is,  the  number  of  colorings 
of  G with  'color  indifference'). 

Theorem  U.l.  Every  Zykov  tree  T for  a graph  G has  2C(G)-1  nodes. 

Proof*  It  is  not  hard  to  check  that  the  vertex  sets  of  the  leaves  of 
T are  in  1-1  correspondence  with  the  proper  partitions  of  G . □ 

The  next  theorem  gives  asymptotic  resiolts  which  by  Theorem  1^.1  above 
may  be  stated  in  terms  either  of  the  size  of  ^kov  trees  for  a graph  G 
or  of  the  number  C(G)  of  proper  partitions  of  G . We  choose  to  state 
them  in  terms  of  the  latter.  It  is  convenient  to  separate  out  part  of 
the  proof  as  a lemna. 

For  every  n in  ^ and  | , r in  2 let  T^(/,r)  be  the  set 
of  graphs  G in  such  that  in  every  Zykov  tree  for  G if  we  steurt 

at  the  root  G we  can  always  make  l(n)  left  turns  and  r(n)  ri^t 
turns  without  reaching  a leaf,  if  a graph  G is  in  T^(/,r) 


then  certainly  every  Zykov  tree  for  G has  at  least  ( ^ ) nodes.  We 

wish  to  choose  the  functions  i and  r so  that  Prob  T (t.r)  ->  1 

n ' ' 

as  n - CD  and  ( ) is  as  large  as  possible. 

Lemna  k,2.  There  exist  functions  I and  r from  W to  such 

that 


Prob  Tj^(i>r)  -*1  «is  n-»® 

(1^.1) 

and 

log(  ) > n(log  n - 3(  i log  n)^/^) 

for  n sufficiently  large 

(1^.2)  i 

For  exanq)le  we  ms^  take  i and  r so  that 

1 

l(n)  = n2-(2"^/5  + o(l))(log  n)’^/5 

(M) 

and 

r(n)  = Ln(l  - ( 1 log  J • 

(4.1^) 

Proof.  Let  t and  r be  functions  from  N to  H > such  that 
l(n)  < ( 2 ) > which  we  shall  choose  below.  For  each  n 

in  W let  m(n)  = n-r(n)  , let  x(n)  = n/m  and  let  k(n)  = ( ™ ) • 

We  shall  choose  r so  that  x(n)  -»  ® as  n -•  ® but  x(n)  = o((log  n)^/^)  . 

Let  denote  the  set  of  partitions  of  fl,2, ...,n]  into  at  least  m 

non-empty  sets.  Then  the  conplement  T^(^>r)  of  T^(i,r)  in  satisfies 

f (i,r)  = U {Gejr  ; Q proper  for  G and  G misses  at  most  I edges} 

c u {GeJ<  ; G^  misses  at  most  I edges}  . (*+»5) 

Let  N be  a binomial  random  variable  with  parameters  k and  q . Then 

by  Lemma  3*2  for  each  partition  Q in  we  have 


19 


Prob{GeJ>^:  misses  at  most  t edges}  < Prob{N  < f}  . (4.6) 

Now  clearly  contains  at  most  n*^  partitions  and  so  by  (4.5)  and  (4.6) 

Prob  T^(f,r)  < n’^  Prob{N  < /}  . (4.7) 

We  shall  use  (4.7 ) to  ensure  that  Prob  T (f^r)  -•  0 as  n -•  ® , and  so 
clearly  we  must  take  I < E[N]  (at  least  for  large  n ).  We  let 

2 

i(n)  ~ I E[N]  = |kq*  ^ . (4.8) 

Now 

Prob{N<i]=:  E (h(q'"^)^(l-q''^)^'^ 

i =0  ^ 

< (^+1)(  (4.9) 

(for  n sufficiently  large  that  |(n)  < E[N]  ). 

Now  by  (4.7),  (4.8)  and  (4.9) 

log  Prob  f^(f,r) 

2 

< n log  n + i log  k - f log  I + I log  e - x I - (k-l)  log  e q + 0(log  n) 

= n log  n + i(log  k - (-  log  2 + log  k - x ) + log  e - x - 2 log  e + o(l)) 

= n log  n - f(log  e - log  2 + o(l)) 

- ® as  n -*  ® 

Hence  (4.1)  holds.  It  remains  to  choose  r . Now 
l^a(  j,  ) > (n-m)(log  I - log  n) 

= n(l-x"^)(log  n - 2 log  x - x^  + 0(1)) 

> n(log  n - x’^  log  n - x^  + x(l  + o(l)))  . (4.10) 
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(2)  We  first  show  that 

log  E[C^]  > n(log  n - (2  log  i log  log  n + 0(l))  . (^•H) 

Let  d be  a function  from  to  ^ such  that  d(n)  -♦  » as  n -•  « 

but  say  d(n)  = O(n/log  n)  . We  shall  choose  d below.  Let  be  the 

set  of  partitions  of  {l,  ...,n]  into  k = Ln/dj  sets  each  of  size  d 
and  (possibly)  the  (n-kd)  singleton  set  {kd+1 fn]  . Then  the 
n\amber  of  partitions  in  equals 


and  the  probability  that  a partition  in  R^  is  proper  equals 

(2)^  |nd 
q > q 

Hence  the  logarithm  of  the  expected  number  of  proper  partitions  in  R^  is 
at  least 

(n-d)  log(n-d)  - (n/d)  log(n/d)  - (n/d)(d  log  d)  - i nd  + 0(n) 

= n(log  n - log  n/d  - log  d - ^ d + 0(l))  . (*<^.12) 

Now  let 

f^(x)  = log  n/x  + log  X + i X 


for  X > 0 . Then  fjj(x)  achieves  a \mique  minimnn  for  x > 0 at 
X = (2  log  n + 1)^/^  - 1 and  this  minimum  equals 


We  set  d(n)  = |_(2  log  for  ne  M and  find  that  the  right  hand 

side  in  (4.12)  equals 

n(log  n - (2  log  n)^/^  - i log  log  n + 0(l))  . 

Hence  certainly  (4.11)  holds. 

We  now  show  that 

log  E[Cj^]  < n(log  n - (2  log  n)^/^  - | log  log  n + 0(l))  . (4.14) 

The  inequalities  (4.11)  and  (4.14)  of  course  prove  the  second  part  of 
the  theorem. 

Let  k = k(n)  be  an  integer  i such  that  the  expected  number  of 
proper  partitions  into  i non-en5>ty  sets  is  a maximum.  Then  clearly 
E[Cj^]  is  at  most  n times  the  expected  number  of  proper  partitions 
into  k non-en5)ty  sets.  Let  d = d(n)  = n/k  . (Thus  d(n)  is  not 
necessarily  an  integer. ) 

Let  Q=  (S^,  ...,S^)  be  a partition  of  n]  and  let 

Sf  = jS^I  for  i = 1,  ...,k  . Then  as  in  [8  ] we  see  that  the  probability 

i (n^/k  - n) 

q 

into  k non-en5)ty  sets  is  at 


and  so 


T.naT.  is  proper  equals 


k |s.(s.-l)  |(7s2-n) 

TT  q = q 

i = l 


Also  the  number  of  partitions  of  {1, . 
most  k^/kl  . Hence 

, n I (n^/k  -n) 

< n 


..,n} 
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log  E[c^j  < n log  k - k log  k - g ^ + 0(n) 

n 1 

= n log  n - n log  d - ^ log  n - i nd  + o(n) 

= n(log  n - f^(d)  + 0(1)) 

But  by  (k.15) 

f^(d)  > (9  log  n)^/^  + i log  log  n + 0(1) 
and  so  we  have  proved  (k.l4). 

(5)  The  left  hand  inequality  in  part  (5)  follows  immediately  from 
Lemma  4.2  and  the  discussion  preceding  it.  Now  clearly 

log  E[C^]  > n(log  n - (2  log  n)^/^)  + log  Prob{log  C(Gj^)  > n(log  n - (2  log  n)^/^)} 
and  so  by  peirt  (2) 

log  Prob{log  C(G^)  > n(log  n - (2  log  n)^/^)} 

< n(-  i log  log  n + 0(1)) 

-»-oo  asn-*co. 

This  proves  the  rigjht  hand  inequality  in  part  (5)^  and  thus  con^jletes  the 
proof  of  the  theorem.  □ 

There  is  a fedrly  large  difference  between  the  left  and  ri^t  hand 
sides  in  the  third  part  of  Theorem  4.5  above.  The  second  part  sxiggests  that 
the  ri^t  hand  inequality  in  the  third  part  may  be  quite  good.  It  thus 
seems  quite  possible  that  the  left  hand  inequality  is  rather  weak.  Recall 
that  the  left  hand  inequality  follows  from  Lemma  4,2,  Proposition  4.4  below 
shows  that  Lenina  4,2  is  in  a sense  best  possible.  Proposition  4,4 
corresponds  to  Proposition  5.7  in  the  next  section.  We  do  not  prove 
Proposition  4,4  here;  it  may  be  proved  along  the  lines  of  the  proof  of 
Proposition  5.7>  using  the  results  in  Section  3. 
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Proposition  4.4.  Let  i and  r be  functions  from  to  ^ such 
that 

> n{log  n - (5 + o(l))(  I log  n)^/^} 

Then 

Prob  Tj^(i^r)  -•  0 as  n . 

Note  that  above  means  that  for  any  function  f such  that 

f(n)  c n{log  n - (5 + o(l))(  i log  n)^/^] 

we  have 

I* 

log(  j.  ) > for  n sufficiently  large. 
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. Pruned  ^kov  Trees. 

In  this  section  we  investigate  the  size  of  pruned  ^kov  trees.  We 
do  not  manage  to  find  out  as  much  about  pruned  2ykov  trees  as  we  found 
out  about  (lanpruned)  Zykov  trees  in  the  last  section,  but  we  are  able  to 
prove  a greater  than  exponential  lower  bound.  This  result  shows  that 
Zykov  algorithms  for  determining  the  chromatic  number  of  a graph  us\ially 
require  more  than  exponential  time. 

We  have  seen  that  every  Zykov  tree  for  a given  graph  has  the  same 
size.  Thus  certainly  if  we  have  to  construct  a Zykov  tree  there  is  no 
point  in  spending  time  choosing  a 'best'  way  of  branching.  The  situation 
is  quite  different  when  we  look  at  pruned  Zykov  trees.  Two  pruned 
Zykov  trees  for  a given  graph  may  have  different  sizes. 

Example,  Two  pruned  Zvkov  trees  for  ■ 

/ \ / \ 

^ 

/ \ 
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For  every  graph  G let  r(G)  be  the  ratio  of  the  greatest  size 
to  the  smallest  size  for  pruned  Zykov  trees  for  G ; and  for  each  n 
in  let  r(n)  be  the  maximum  value  of  r(G)  over  all  graphs  G 

on  n vertices.  Thus  r(n)  is  a measure  of  the  possible  variation 
in  sizes  of  pnmed  Zykov  trees  for  graphs  on  n vertices. 

For  each  graph  G on  at  most  four  vertices  we  have  x(Cf)  = iu(G) 
and  so  every  pruned  Zykov  tree  for  G has  exactly  one  node.  Thus 
r(l)  = r(2)  = r(5)  = r(k)  = 1 . 

The  example  above  shows  that  r(5)  > 1 , and  by  adding  isolated  vertices 
to  a graph  it  is  easy  to  see  that  r(n)  (strictly)  increases  from  n = 5 
onwards.  Thus 


r(n)  > 1 for  n > 4 . 

In  fact  r(n)  grows  dramatically  with  n . 


Proposition  3.1.  r(n)  > 


§ (1+0(1)) 


We  prove  Proposition  5.1  by  constructing  for  each  integer  n > 7 
a graph  Hj!^  on  n vertices  such  that 

r(H;)  > 2C{0  ).i  , n'" 

[lJ-1 


(5.1) 


Here  C(0j^)  is  the  number  of  partitions  of  a set  of  k distinct  elements 
(see  Theorem  4.5). 

First  for  each  integer  k > 5 let  Hj^  be  the  pentagon  plus  (k-5 ) 

vertices  adjacent  to  each  other  vertex.  Thus  Hj^  is  a 'wheel  with  (k-5) 

axles';  see  the  example  below  for  . It  is  easy  to  check  that  <«(Hj^)  = k-3 

and  = k-2  ; and  that  every  pruned  Zykov  tree  for  has  exactly  three 

nodes.  Now  for  each  integer  n > 7 let  H'  be  the  graph  H_ 

1 1 

together  with  [_n/2j-l  isolated  vertices.  ^ 


By  branching  within  the  large  con5)onent  of  we  see  that  the 
smallest  size  of  a pioined  Zykov  tree  for  is  3 • Now 

Ln/2J-1  < rn/21-1  = x(h;^)  • 

Hence  by  branching  first  amongst  the  |_n/2j-l  isolated  vertices  in 
we  see  that  the  greatest  size  of  a prmed  Zykov  tree  for  is  at  least 

the  size  of  an  unpnmed  Zykov  tree  for  the  graph  ^|^n/2j-l  consisting 
of  Ln/2j-l  isolated  vertices.  But  by  Theorem  If. 3 every  Zykov  tree 
for  this  graph  has  n/2j -l^"^  nodes.  We  have  now  proved  (5.1) 

and  so  conqpleted  the  proof  of  Proposition  5.1.  □ 


Note  that  if  the  isolated  vertices  are  listed  first  then  the  marked 
Zykov  algorithm  will  explore  at  least  the  large  pruned  Zykov  tree  for 
and  so  the  backtrack  coloring  algorithm  will  also  do  badly  (see  Section  6) 
We  now  move  on  towards  our  main  res\ilts.  We  need  first  a lemma 
concerning  the  chromatic  number  of  a random  graph,  which  is  taken 
essentially  from  [8].  Recall  that  8dl  logarithms  are  to  the  base  l/q 
unless  otherwise  indicated.  A set  of  vertices  in  a graph  G is  stable 
if  no  two  sure  suijacent,  and  the  stability  number  Q:(g)  is  the  greatest 
number  of  vertices  in  a stable  set. 


' u'.s.vjm.  ».:ir . 
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Lemma  5.2.  For  almost  all  graphs  G in 

o -r  JJ 

x(Gn)  > I n/log  n . 

Proof.  If  n then  certainly  the  stability  mmiber 

a(G  ) of  G satisfies 
n " 


n 


a(Gj^)  > n/x(G^)  > 2 log  n . 
But  if  we  set  s(n)  = [2  log  nl  then 


( ^ ) 
n X 2 ' 


0 as  n 


Probfa(Gjj)  > si  < ( ^ )q 
Hence  Probfx(G^)  < ^ n/log  nl  -♦  0 as  n -♦  ® . □ 

The  following  conjecture  appears  essentially  in  [8], 
Conjectxure  $.3.  If  e > 0 then  for  almost  all  graphs  G^  in 
x(Gn)  < ( I + e)n/log  n . 


We  need  one  more  lemma  in  order  to  prove  our  main  results.  Suppose 
that  we  have  a positive  constant  a and  functions  £ and  r from  W 
to  2 . For  each  n in  Psi  let  T^(i>r)  be  the  set  of  graphs  G 
in  such  that  in  every  Zykov  tree  for  G whenever  we  start  at  the 

root  G and  make  f(n)  left  turns  and  r(n)  right  turns  we  do  not 
encounter  any  node  H with  u)(H)  > a x(G)  • (Cong^are  with  the  definition 
of  T^(f^r)  preceding  Lemma  k,2  in  Section  k.)  If  G is  a graph  in 
T^(/>r)  then  certainly  every  Zykov  tree  for  G has  at  least  ( ) 

nodes  H with  w(H)  < a x(G)  • Thus  setting  a » 1 we  see  that  if  G 
is  in  T^(i,r)  then  every  pruned  Zykov  tree  for  G has  at  least  ( ) 

nodes.  We  wish  to  choose  the  functions  t and  r so  that 
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Prob  T^(i,r)  -•  1 as  n -•  ® and  ( ) is  as  large  as  possible. 

Lemma  5.U.  Let  a be  a positive  constant.  Then  there  exist  functions 
t and  r from  ^ to  z such  that 

Prob  l^(i,r)  -•  1 as  n -•  <»  (5.2) 

and 

log(  ~ 0(  n(^  log  n)^/^  . (5.3) 

For  example  we  may  take 

f(n)  = i_n^/^(log  n)"5j  (5.4) 

and 

r(n)  = La(l2  log  n)'^/^J  . (5.5) 

Lenma  5«^  above  of  course  corresponds  to  Lemma  4.2  for  (unpruned) 

Zykov  trees,  and  we  saw  in  Section  4 that  Lemma  k.2  is  in  a sense  best 
possible.  At  the  end  of  this  section  we  shall  prove  that  Lemma  5.^  is 
also  in  a sense  best  possible. 

Proof.  Let  t and  r be  ftmctions  from  to  M , which  we  shall 

QL  "1 

choose  later.  Let  b(n)  ^ — n(log  n)  J , let  be  the  set  of 

graphs  G in  such  that  j^(g)  > b(n)  , and  let  B^(f,  r)  be  the  set 

of  gragphs  G in  such  that  in  every  ^kov  tree  for  G whenever  we 

start  at  the  root  and  make  /(n)  left  turns  euid  r(n)  right  turns  we 
do  not  encounter  any  node  H with  u»(h)  > a b(n)  . Then 

^ T“(l,r)  . (5.6) 

By  Lenma  5.2  Prob(B^)  -•  1 as  n -»  ® . Hence  if  Prob  B^(i,r)  -»  1 as 

a, 

n -»  ® then  so  does  Prob  Tjj(f,r)  . Thus  we  wish  to  choose  t and  r so 
that  Prob  -.1  as  n ® and  ( p ) is  as  large  as  possible. 


^ ,-L  ^ 


sr: 
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We  now  look  at  the  ccmqplement  B^(f,r)  of  B^(/,r)  in  ^ . 

Let  ft  be  the  collection  of  all  families  Q=  (S^,  ...,3^^)  of  b disjoint 

subsets  of  {1, ...,n}  with  iinion  containing  r+b  elements.  For  each 

family  Q in  ft  let  T_  be  the  set  of  graphs  G in  ^ such  that  the 

H n 

contracted  graph  misses  at  most  / edges.  Now  if  G is  a graph 
in  B^(/,r)  then  some  graph  obtained  from  G by  performing  at  most  r 

vertex-contractions  contains  a subgraph  on  b vertices  missing  at  most 

i edges;  and  so  G € for  family  Q (proper  for  G ) in  ft  . Hence 


\(i>r)  c U {T^:  Q^ft} 


(5.7) 


Next  we  find  an  upper  bovind  for  Prob(Tjj)  . It  is  ccxivenient  to  let 
”1  = ( 2 ) • We  shall  choose  r so  that  x(n)  -«  ® as 

n oo  . Let  N be  a binomial  random  variable  with  parameters  m and 
By  Lemma  5.2  for  each  Q in  ft  , 

Prob(T^)  < Prob{N  < /}  . (5.8) 

Now  cleeu-ly  ft  contains  at  most  n*^  families  Q . Hence  by  (5.7) 


and  (5.8) 

Prob  < n“  Prob{N  < i] 

We  shall  use  (5.9)  to  ensure  that  Prob  Bjj(i>r)-.0  as  n-.*,  and 
so  of  coiirse  we  need  i(n)  < E[N]  (at  least  for  large  n ). 


(5.9) 


We  set 


tin)  = [ I E[N]  J = [ I mq*  J 


Prob{N<i)=  E (?)(q''  )^(l-q'"^)®'^ 
i = 0 ^ 

< (f+l)(®)q*  ^1  -q"^  )“"^ 


(5.11) 


31 


f 


Note  that  the  right  hand  side  above  depends  only  on  x (and  n ),  We 
have 

log  Prob{N  < t] 

2 

< I log  m - i log  / + / log  e - X / - (m-i)  log  e q + O(log  n) 

1 2 2 

= /(log  m - (log  ^ + log  m - X ) + log  e - x - 2 log  e + o(l)) 

= /(log  2 - log  e + o(l))  . (5.12) 

Now  suppose  that  r(n)  = |_\n(log  for  some  constant  \ with 

0 < X.  < ^ a say.  Then  x(n)  (2x/a)(log  n)^/^  and 

log  /(n)  = (2  - k\^/o^  + o(l))  log  n . (5.15) 

But  now  by  (5.9)>  (5.12)  and  (5.15) 

Prob  Bjj(/,r)  -»  0 as  n . 

We  next  look  at  the  value  of  ( ^ ) and  choose  a value  for  \ . 

Now 

log(  ^^^  ) = r{log  / - log  r + 0(1)1 

= \n(log  n)"^/^(2  log  n - (4\^/a^) (log  n)  - log  n + 0(log  log  n)l 
= (X  - + o(l))  n(log  n)^/^ 

The  maximum  value  of  X - for  X > 0 is  attained  at  \ s 12~^/^a  < 

Thus  we  give  \ this  value,  and  find  that 

log(  ^^^  ) = (5'^^^  + o(l))  an(log  n)^/^  , (5.lU) 

as  required.  The  value  we  have  chosen  for  r is  as  in  (5.5).  Clearly 
we  may  decrease  the  value  of  / from  that  in  (5.10)  if  we  do  not  thus 
falsify  (5.lU).  Thus  we  may  set  / as  in  (5.^).  This  conq)letes  the 
proof  of  Lenma  5.^*^.  U 
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From  Lemma  ^.k  and  the  discussion  preceding  it  we  may  now  deduce 
immediately  o\ir  main  results. 

Theorem  If  ot  is  a positive  constant  then  for  almost  all  graphs 

in  , every  Zykov  tree  for  is  such  that  the  logarithm  of  the  number 

of  nodes  H with  u)(H)  < a x(^)  asyngjtotically  at  least 

an(^logn)^/^  . 

The  most  interesting  special  case  of  Theorem  5.5  above  is  i^en 
p = q = 1/2  and  a = 1 . 

Corollary  5.6.  Coasider  the  property  for  graphs  G on  n vertices 

1 ^ 

that  every  pruned  Zykov  tree  for  has  size  at  least 

n(log^  n)^/^ 

(l.ll^)  ^ 

The  proportion  of  graphs  on  n vertices  with  this  property  tends  to  1 
as  n -*  00  , 

Corollary  5*6  shows  that  any  Zykov  algorithm  as  defined  in  Section  2 
'almost  always*  requires  more  than  exponential  time.  Thus  certainly 
there  exists  a sequence  (G^, Gg^ . . .» . . . ) such  that  G^  is  a graph 
on  n vertices  and  the  time  taken  by  any  Zykov  algorithm  on  G^  grows 
faster  than  e^onentially  with  n . No  construction  is  known  for  such 
a sequence. 

M.  R.  Garey  and  D.  S.  Johnston  [7]  have  shown  that  the  problem  of 
determining  the  chromatic  number  of  a graph  to  within  a factor  less 
than  2 is  NP-complete.  By  analogy  one  mi^t  have  expected  some  effect 
in  Theorem  5.5  at  a = 1/2  say,  but  none  is  apparent  (see  also 
Corollary  7.2  below). 


The  above  discussion  is  asyn^jtotic  in  nature,  but  we  may  be 

interested  in  applying  a ^kov  algorithm  to  graphs  which  are  fairly 

large  but  definitely  finite,  say  to  graphs  with  500  vertices.  Arguments 

similar  to  those  above  but  simpler  show  that  we  are  already  in  trouble. 

We  shall  see  below  that  for  more  than  5/^  of  the  graphs  on  500  vertices 

12 

every  prvined  Zykov  tree  has  more  than  10  nodes. 

Set  p = q s=  1/2  so  that  probabilities  correspond  to  proportions. 
We  shall  be  talking  about  graphs  in  • Note  first  that,  as  in  the 

proof  of  Lemma  5.2,  we  have 


Prob{x(G)  < 59}  < Prob{a(G)  > 14} 


< ( )2 


< 0.24 


(5.15) 


For  positive  integers  t and  m let  S(£,m)  be  the  set  of  graphs 
G in  which  have  a subgraph  on  m vertices  missing  at  most  I 

Dl  1 

edges.  Denote  ( ^ ) by  k and  suppose  that  £ < ^ k . Then 


Prob  S(£,m)  < ( )2"^  E (^) 

i=k-£ 


m 


< / 500  V -k/  k N k-s+1 
- ' m ''  ^ t ^ k-2s+l 


It  is  easy  to  check  using  the  above  that  for  exar^jle 

Prob  S(53»28)  < .01  . (5.16) 

Let  A be  the  set  of  graphs  G in  J^qq  such  that  ^(G)  > 59  and  G 
is  not  in  S(53;28)  . Then  by  (5.15)  and  (5.16) 

Prob  A > 0.75  . (5.17) 
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Now  let  G be  a graph  in  A and  let  T be  a pruned  Zykov  tree 
for  G . Then  in  T if  we  start  at  the  root  and  make  55  left  turns 
and  11  right  tvims  we  can  never  reach  a leaf;  for  if  H is  a leaf 
of  T then  H has  a complete  subgraph  on  59  vertices  and  at  least 
59  - 11  = 28  of  them  must  be  original  vertices  of  G . Hence  the 
number  of  leaves  of  T is  more  than 

2(  > 5 xlO^ 

1 g 

and  so  the  number  of  nodt's  in  T is  more  than  10  . Hence  by  (5.1?) 

for  more  than  5/U  of  the  graphs  G in  every  pnuied  Zykov  tree 

12 

for  G has  more  than  10  nodes. 

The  basic  result  in  this  section  is  of  course  Lemna  from  which 
Theorem  5*5  and.  Corollary  5.6  follow  immediately.  We  remarked  earlier 
that  Lemma  5.^  corresponds  to  Lemma  k.2  and  we  noted  in  Section  h that 
Lemma  h,2  is  in  a sense  best  possible.  We  now  investigate  how  good 
Lemma  is.  Proposition  5.7  below  shows  that  in  a (weaker)  sense 
Lemma  5.^  is  also  best  possible.  This  suggests  that  our  lower  bound 
for  the  size  of  a smallest  pruned  Zykov  tree  for  a graph  may  not  be  too 
bad.  However,  our  only  upper  bound  for  the  size  of  a smallest  pruned 
Zykov  tree  for  a graph  is  very  much  larger  (see  Corollary  6.2  in  the 
next  section). 

Proposition  5.7.  Let  a be  a positive  constant.  If  t and  r are 
functions  from  ^ to  ^ such  that 

log(^y^)  > (2 + o(l))  an(  ^log  n)^/^  (5.18) 

then 

Prob  T^(i,r)  -0  as  n - « , (5.19) 
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Further  if  the  Conjecture  5.5  holds  and  if 


H,., 


••5 

'* 
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log(^^^)  > (l  + o(l))an(^log  n)^/^  (5.20) 

then  again  (5.19)  holds. 

Proof.  For  each  n in  ^ let  5(n)  be  a real  number  such  that  say 
1 < 5>(n)  < 5 . Suppose  that  t and  r are  functions  from  W to  W 
such  that 

log(^^’^)  > (b +0(1))  an(  ilog  n)^/^  . (5.21) 

For  each  n in  let  d(n)  = (b/2)  n/log  n , let  be  the  set  of 

graphs  in  such  that  x(Gn)  < <i(’^)  ^ the 

set  of  graphs  in  such  that  in  every  Zykov  tree  for  whenever 

we  start  at  the  root  and  make  £(n)  left  turns  and  r(n)  ri^t  turns 
we  do  not  encounter  any  node  H with  tu(H)  > 0£  d(n)  . Then 

T^(f,r)  nC^  c D^(i,r)  . (5.22) 

We  shall  prove  that 

Prob  D(i,r)-*0  as  n-.®  . (5.25) 

\ 

■1 

Once  we  have  done  this  we  are  nearly  finished.  ; 

Note  first  that  we  may  assume  that  i(n)  < ( g ) and  r(n)  < n-1  . 

Also  if  < r for  some  n in  ^ then 

( '«)  < < 2^” 

' r — r ^ — 

I 

and  so  by  (5.21)  we  have  log(/+r)  = log  i + 0(l)  . 

Now 

( '7  ; 

and  so  by  (5.21)  again  1 

r(n)  > (c^ + 0(1))  n (log  n)"^/^  for  some  constant  c^  > 0 . (5.24) 

We  next  show  that  we  may  assume  that 
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r(n)  < (cg  + o(l))  n (log  for  some  constant  Cg  > 0 . (5.25) 

For  each  n in  ^ let  s(n)  = r (2  log  and  t(n)  = rad(n)1  . 

Then  by  Lemma  5«5 

ProbfG^  con^jlete]  -»  1 as  n -•  ® , (5.26) 

But  we  may  obtain  the  graph  from  the  graph  G^  by  performing  at 

most  (s(n)-l)t(n)  vertex-contractions,  and  so 

Dj^(0,  st)  c {G^  not  ccanplete}  . (5.27) 

Now  by  (5.26)  and  (5.27) 

Prob  0^(0  ,st)-»0  as  n-.«  . 

It  follows  that  we  may  assume  that  (5.25)  holds. 

We  now  show  that  for  n sufficiently  large  we  have 

l(n)  > X?  . (5.28) 

Let 

x(n)  = r(n)(log  n)^^^/n 

so  that  by  (5.2U)  and  (5.25)  we  have  log  x = 0(1)  . Note  that 
r/ad  = 2x/q?  (log  n)^/^ 

Now  if  (5.28)  is  false  then  for  infinitely  many  values  of  n we  have 

./  V ^ 2 (r/ad-1)^ 

i(n)  < n q'  ' ' 

and  so 

log(  ) = r(log  I - log  r + 0(1)) 

< X n(log  n)”^/^(2  log  n - r^/o^d^  + 2r/ad  - log  n 

+ i log  log  n + 0(1)) 

= (x  - l+x^/o(^6^  + 0(1))  n (log  n)^/^ 

< (6  + o(l))an(ilog  n)^/^ 
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(see  the  proof  of  Lemma  5.^).  But  this  contradicts  (5.21)  and  so  (5.28) 
must  hold. 

Now  for  each  n in  ^ let  s(n)  = Pr(n)/ad(n)  1 -1  and 
t(n)  = rad(n)n  . By  (5.25)  and  Lemma  3.5 

Prob{Q(G^)  full]  -•  1 as  n - ® . (5.29) 

Also 

(s(n)-l)t(n)  < (r/ad  -l)(ad  + l) 

< r 

2 2 

for  n sufficiently  large  that  ad  > r . Hence  as  in  the  derivation 
of  (5.27)  we  have  that  for  n sufficiently  large 

Djj(i, r)  c {G^  misses  more  than  f edges}  . (5*50) 

For  each  n in  M let  N be  a binomial  random  variable  with  parameters 
t 

( 2 ) and  q . Then  by  Lemma  3.3 

Prob{Gj^  misses  more  than  / edges] 

<l-Prob{N<  1}  Prob{Q(G^)  full]  . (5.31) 

But  /(n)  -•  00  as  n -*  « and  by  (5.28)  i(n)/E[N]  -•  » as  n -*  ® . 

Hence 

ProbfN  <i}-l  as  n-.®  . (5.52) 

But  now  (5.25)  follows  by  (5.29),  (5.30),  (5.31)  and  (5.32). 

Suppose  that  f^(n)  =2  + e(n)  for  n in  M ^ where  g(n)  > 0 
and  e(n)  0 as  n -»  ® sufficiently  slowly  that  by  Theorem  8 in  [8] 
we  have 

Prob  -•  1 as  n ® . (5.55) 
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proved  that  if  (5»l8)  is  true  then  so  is  (5.19)«  Now  suppose  that  the 
Conjecture  5«5  is  true  and  that  5(n)  = l+e(n)  for  n in  ^ , where 


c(n)  >0  and  e(n)-»0  as  n-»oo  sufficiently  slowly  that  (5.55) 

holds.  Hien  as  above  it  follows  that  if  the  Conjecture  5.5  (5.20) 

are  true  then  so  is  (5.19).  This  con5)letes  the  proof  of  Proposition  5.?.  □ 


1 1 


In  this  section  we  investigate  the  ’backtrack'  coloring  algorithm 
(BC  algorithm)  for  determining  the  chromatic  number  of  a graph.  This 
algorithm  was  pointed  out  to  the  author  by  R.  Tarjan.  Given  a graph  G 
it  explores  part  of  the  ’backtrack  coloring  tree'  (BC  tree)  for  G , 
which  is  an  implicit  envimeration  of  the  proper  partitions  of  G . We 
shall  see  that  the  BC  algorithm  is  essentially  the  same  as  a certain 
Zykov  algorithm,  the  'marked'  Zykov  algorithm.  Also  we  shall  give  an 
upper  bound  for  the  number  of  nodes  of  the  BC  tree  explored  by  the  BC 
algorithm.  It  will  follow  that  it  is  worth  prvuiing  BC  and  Zykov  trees. 

We  first  describe  the  backtrack  coloring  tree  (BC  tree)  for  a graph 

G in  . It  is  a rooted  tree  with  height  n-1  . Each  node  is 

colored  with  one  of  the  colors  c, ,...,c„  . A node  colored  c.  at 

i n 1 

depth  d (distance  d below  the  root)  corresponds  to  an  assignment  of 
color  c^  to  vertex  (d+1)  of  G . By  looking  at  a node  and  its 
ancestors  we  see  that  a node  at  depth  d corresponds  to  a coloring  of 
the  first  (d+1)  vertices  of  G . To  construct  the  BC  tree  for  G 


We  have  now  defined  the  BC  tree  for  G . It  is  not  hard  to  see 
that  there  is  a 1-1  correspondence  between  the  nodes  of  the  BC  tree 
for  G at  depth  d and  the  proper  peirtitions  of  the  subgraph  of  G 
induced  by  the  first  (d+1)  vertices  (see  Example  6,1  below).  Hence 
the  number  of  nodes  in  the  BC  tree  for  G is  between  C(G)  and  nC(G)  , 
and  so  Theorem  gives  asymptotic  results  about  the  size  of  BC  trees. 

If  k € N the  BC  tree  for  G pruned  at  k is  sin5)ly  the  root  of 
the  BC  tree  for  G if  k = 1 and  otherwise  it  is  the  mique  maximal 
rooted  subtree  of  the  BC  tree  for  G such  that  each  internal  node  is 
colored  with  one  of  the  first  (k-1)  colors.  The  prvmed  BC  tree  for  G 
is  the  BC  tree  for  G pruned  at  x(^)  • 

Example  6.1.  Take  G as  the  cycle  with  5 vertices,  nxambered  as 
indicated. 
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5 ^ 


In  (a)  below  we  show  the  part  of  the  BC  tree  for  G explored  by  the  BC 
aJ-gorithm.  In  (b)  we  show  the  same  tree  structure  and  indicate  at  each 
node  the  corresponding  partial  coloring  of  G . The  letters  a,  ...,J 
indicate  the  order  in  ^ich  the  nodes  are  first  visited  by  the  BC  algorithm, 


The  backtrack  coloring  eilgorithm  (BC  algorithm)  for  determining 
the  chromatic  number  x(®)  graph  G conducts  a depth-first  search 

of  the  BC  tree  for  G , keeping  to  the  right.  Cnee  we  have  foimd  a 
path  from  the  root  to  a leerf  using  at  most  the  first  k colors  we  know 
that  5 ^ 'f®  need  not  explore  the  descendants  of  any  node 

l^elled  with  a color  not  in  the  first  k-1  . Thus  we  'prune'  the  BC 
tree.  The  BC  edgorithm  must  of  course  explore  all  the  nodes  of  the 
pruned  BC  tree  for  G . 

In  order  to  relate  the  BC  algorithm  to  the  Zykov  branch- and-bound 
algorithms  considered  earlier  we  first  give  a description  of  an 
implementation  of  a ^kov  algorithm. 

Let  G be  a graph  in  for  some  n in  ^ . We  shall  define 

the  marked  Zykov  tree  (MZ  tree)  for  G . It  is  a certain  ^kov  tree 

for  G in  which  at  each  node  certain  vertices  are  'marked' . At  each 
node  H the  marked  vertices  form  an  initial  segment  of  the  entire 
sequence  of  vertices  — we  assume  that  the  sets  in  each  partition  of 
{1,  ...,n}  are  ordered  so  that  we  have  an  increasing  sequence  of  lesist 
integers  — and  the  marked  vertices  induce  a complete  subgraph  of  H . 
The  MZ  tree  of  G is  defined  as  follows.  The  root  is  of  course  G , 

and  we  mark  vertex  1 . Suppose  that  H is  a leaf  of  the  tree  so  far 

constructed.  If  the  first  \inmarked  vertex  in  H is  adjacent  to  each 
marked  vertex  then  mark  this  vertex.  Continue  doing  this  until  either 
every  vertex  of  H is  marked,  in  ^ich  case  H is  complete  and  is  a 
leaf  of  the  MZ  tree  of  G ; or  the  first  unmarked  vertex  is  not  adjacent 
in  H to  some  marked  vertex.  In  this  case  we  branch  on  the  first 
unmarked  vertex  and  the  first  marked  vertex  not  adjacent  to  it.  Marked 


i 

I 


j 
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vertices  stay  marked  in  the  sons  of  H and  the  new  contracted  vertex 
in  the  rightson  is  also  marked. 

The  marked  Zykov  algorithm  (MZ  algorithm)  explores  part  of  the  MZ 
tree  using  depth-first  search  keeping  ri^t,  and  prunes  the  tree  using 
the  fact  that  the  marked  vertices  at  a node  point  out  a con^dete  subgraph. 
The  MZ  algorithm  must  of  course  e3q>lore  every  node  in  the  pruned  Zykov 
tree  corresponding  to  the  MZ  tree.  It  is  quite  similar  to  the  algorithm 
in  [5]. 


Exan^le  6.2.  As  in  Exan^le  6.1  take  G as  the  cycle  with  5 vertices, 
numbered  as  indicated. 


Then  the  part  of  the  MZ  tree  for  G e^qjlored  by  the  MZ  figorithm  is  shown 
below.  The  marked  vertices  are  filled  in,  and  in  addition  we  have  labelled 
the  first  marked  vertex  with  , the  second  with  c^  and  the  third 
with  a,  . 
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It  should  be  apparent  that  the  BC  and  MZ  algorithms  are  really 
different  forms  of  the  same  algorithm.  Suppose  that  G is  a graph 


in  . Then  it  is  not  hard  to  prove  that  there  is  a correspondence 
between  the  nodes  of  the  BC  tree  B for  G and  the  nodes  of  the  MZ 
tree  Z for  G such  that 

(a)  each  node  in  B corresponds  to  one  or  two  nodes  in  Z ; 

(b)  each  node  in  Z corresponds  to  between  1 and  n nodes  in  B ; 

(c)  pruning  occurs  at  corresponding  nodes. 

The  lettering  in  Examples  6.1  and  6.2  indicates  such  a correspondence. 
Let  B and  Z be  the  parts  of  the  trees  B and  Z explored  by  the 
BC  and  MZ  algorithms  respectively.  Then  by  the  above 

2lB*|  > jz*l  and  nlz*|  > |B*| 

It  follows  by  Corollary  5.6  that  for  almost  all  graphs  on  n vertices 

1/2 

the  BC  algorithm  requires  time  at  least  for  some  constant 

c > 1 . The  next  restilt  yields  an  upper  bound  for  the  time  required  by 
the  BC  or  MZ  algorithm. 

Theorem  6.1.  Let  e > 0 . Then  for  almost  all  graphs  in  the 

number  of  nodes  of  the  BC  tree  explored  by  the  BC  algorithm  is  at  most 
(|  + e )n 

n . If  Conjecture  5.5  is  true  then  for  almost  adl  graphs  G 

(^e)n 

in  Jr  the  pruned  BC  tree  for  G has  at  most  n nodes, 

n n 

Proof.  Let  k be  a function  from  N to  . For  each  graph  G 
in  let  B (G)  be  the  BC  tree  for  G pruned  at  k . For  i,J 

in  let  f(i,  J)  be  the  expected  nxanber  of  proper  partitions  into  J 

sets  of  graphs  in  J^  . Then 


f 

! 


E[jB^G  )]]  < E E f(i,J)  . 
i=l  j=l 

From  the  proof  of  Theorem  3«5  we  have 
./.  .X  . .i  2^^ 

< J 9 

and  so  if  i < n and  j < k we  certainly  have 
Now  let 

lt(n)  = L(l+e)n/log  nj 
for  n in  W . Then  for  i € N , i < n 

n q ' ^ n q ' 

and  so  by  (6.1)  and  (6.2) 

EI|bNc^)|]  < n"*®  q'^“ 

( I + +o(l))n 


Hence 


Prob{lB^G^)|  < n 


( I + e)n 


) -♦  1 as  n -♦ 


Now  the  BC  algorithm  initially  explores  the  'rightmost'  path  in  the  BC 
tree,  and  so  initially  it  acts  like  the  sequential  coloring  algorithm. 
Hence  by  Theorem  8 in  [8],  for  almost  all  graphs  in  Ji^  the  BC  algorithm 
explores  at  most  n nodes  of  the  BC  tree  which  are  not  in  the  BC  tree 
pruned  at  k . The  first  part  of  Theorem  6.1  now  follows  from  (6.3). 

We  now  prove  the  second  part  of  the  theorem.  Let 
k(n)  « L(l+c)  I n/log  nJ 
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for  n in  ^ , Then  for  i £ ^ ^ ^ 

i ^/2)a 
n q ' < n 

and  so  by  (6.1)  and  (6.2) 

j,  , (i(l+e)+o(l))n 

E[1bNg^)1]  < n 

Hence  as  above 

k ( 

Prob{|B^(G^)|  < n ^ } -1  as  n ^ ® . (6.4) 

Denote  the  pruned  BC  tree  for  a graph  G by  B*(G)  . If  x(G)  < k 
then  1b*(G)1  < |B^(G)1  . Thus 

* ( 17'*' V ( ^‘*"e)j^ 

{lB(G^)l<n^  } 5 {IB  (G„)l  <n  }n{x(G^)<k]  . (6.5) 

Now  suppose  that  Conjecture  5.5  holds,  so  that 

Prob[x(G^)  < k}  1 as  n ® . (6.6) 

Then  the  second  part  of  Theorem  6.1  follows  from  (6.4),  (6.5)  and  (6.6).  □ 


Corollary  6.2.  Let  e > 0 . Then  for  almost  all  graphs  in 

the  number  of  nodes  of  the  marked  Zykov  tree  for  G_  explored  by  the 

marked  Z^'kov  algorithm  is  at  most  n . if  Conjecture  5.3  holds 

then  for  almost  all  graphs  in  the  pruned  marked  Zykov  tree 

( J'*'e)j^ 

for  has  at  most  n nodes. 


7.  Lengths  of  Proofs. 


Most  of  our  results  so  far  may  be  phrased  in  terms  of  the  lengths 
of  certain  kinds  of  proof  which  determine  chromatic  numbers  or  viiich 
establish  lower  bounds  for  chromatic  numbers.  We  then  obtain  results 
concerning  chromatic  numbers  which  are  similar  in  spirit  to  recent  results 
of  V.  Chvatai  [!»■]  concerning  stability  numbers.  Indeed  this  paper  was 
initially  motivated  by  discussions  with  Chvatai  concerning  his  results. 

If  k is  an  integer  at  least  as  great  as  x(^)  then  there  is  a 
short  proof  that  x(^)  5 ^ — namely  we  may  exhibit  a coloring  of  G 
using  at  most  k colors.  In  general  such  a proof  is  heird  to  find  but 
it  must  of  course  exist.  However;  if  k is  at  most  x(G)  then  it 
is  not  clear  if  there  is  necessarily  a short  proof  of  this  fact. 

The  following  two  rules  may  be  used  to  determine  or  boimd  chromatic 
numbers  (see  Section  2 and  (2.1)  in  particular). 

(Rl)  x(G)  = ®in{x(Gy  ,x(GV}  • 

(R2)  If  G is  complete  then  x(G)  equals  the  number  of  vertices  of  G . 

Given  a set  S of  rules  like  (Rl)  and  (R2)  let  us  call  a proof  that  uses 
only  these  rules  an  S-proof,  and  each  application  of  a rule  in  S a step. 
Clearly  there  is  a close  correspondence  between  an  {(Rl), (R2) }-proof 
determining  x(G)  and  a Zykov  tree  for  G . 

From  Theorem  U.l  we  obtain 

Corollary  7.1.  If  G is  a graph  in  then  every  { (Rl),  (R2) }-proof 
which  determines  x(G)  without  redundant  steps  has  exactly  2C(G)-1  steps. 

Thus  by  Theorem  U.5  we  know  quite  a lot  about  the  lengths  of 
{(Rl), (R2) }-proofs  which  determine  chromatic  numbers.  Consider  now  a 
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third  rule,  vAiich  can  be  used  to  establish  a lower  bound  for  chromatic 
ntnnbers  (see  (2.2)). 

(I^)  x(G)>u>(G)  . 

Allowing  the  use  also  of  the  rule  (R5)  corresponds  to  pruning  our  Zykov 
trees.  From  Theorem  5«5  we  obtain 


i 

* 


Corollary  7.2»  If*  ct  is  a given  constant  factor  with  0 < a < 1 then 
for  almost  all  graphs  in  every  {(Rl),  (R5)}-proof  which  establishes 

a lower  bound  for  exact  to  within  the  factor  a is  such  that  the 

logarithm  of  the  number  of  steps  is  asynptotically  at  least 

a n(  ^log  n)^/^  . 

Now  set  p = q = 1/2  and  a = 1 in  Corollary  7*2  (as  we  did  in 
Theorem  5.5). 

Corollary  7.5.  Consider  the  property  for  graphs  in  that  in 

every  {(Rl,  (R3)}-proof  establishing  the  correct  lower  bound  for  x(Gjj^ 

the  number  of  steps  is  at  least 

n(log_  n)^/^ 

(l.lU)  ^ 

The  proportion  of  graphs  in  with  this  property  tends  to  1 as  n - ® . 
From  Corollary  6.2  we  obtain 


Corollary  7.J|.  Let  e > 0 . Then  for  almost  all  graphs  G in  ^ 


n 


the  marked  ^kov  algorithm  yields  and  {(Rl),  (R2),  (RJ  )]-proof  determining 

x(G^)  with  at  most  n steps.  If  Conjecture  5.3  holds  then  for 

almost  all  graphs  in  the  marked  ^kov  algorithm  (eventually) 
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yields  an  {(Rl),  (E2),  (R5 )]-proof  determining  at  most 

(J-*e)n 

n steps. 

Consider  now  a fourth  rule  which  can  be  used  to  bound  chromatic 
numbers. 

(rU)  If  G has  a subgraph  H then  x(G)  > x(H)  • 

The  set  of  rules  {(Rl),  (R2),  (Rlt) } seems  to  the  author  to  be  as  natural 
as  the  set  {(Rl),  (R3)}  for  establishing  lower  bovinds  for  chromatic  numbers. 
The  following  proposition  shows  that  the  two  sets  of  niles  are  in  a sense 
equivalent.  The  proof  is  strai^tforward  and  is  omitted. 

Proposition  7.3.  For  any  {(Rl),  (l^)} -proof  that  x(^)  > ^ there  is  an 
[(R1), (R2), (RU)}-proof  with  at  most  twice  as  many  steps;  and  for  any 
{(R1),  (R2),  (Rl*^)}-proof  that  x(G)  >k  there  is  an  { (Rl),  (R5  ) }-proof 
with  no  more  steps. 

At  first  sight  it  might  seem  to  be  of  advantage  to  allow  also  rules 
like  the  rule  (r5)  below,  which  is  closely  related  to  the  rule  (Rl). 

(R5)  x(G)  > max{x(G^)  , x(G^  ]-l  • 

cue  would  of  course  not  have  to  know  both  x(^^)  order 

to  use  the  rule  (R5).  However,  it  is  not  hard  to  prove  for  exan5>le  the 
following  proposition. 

Proposition  7.6.  For  any  [(Rl), . ..,  (R5)}-proof  that  x(G)  > k there 
is  an  {(Rl),  (R5)]-proof  with  no  more  steps. 

Another  rule  idiich  might  be  considered  is  the  following. 
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(r6)  If  some  vertex  v in  G is  adjacent  to  each  other  vertex  then 
x(G)  = x(G-'v)+1  (where  G-v  has  the  obvious  meaning). 


However,  again  we  may  see  without  difficulty  that  including  this  rule 
would  not  lead  to  shorter  proofs. 

Yet  another  possible  rule  which  mi^t  be  thou^t  helpful  is  the 
'principle  of  separation  into  pieces',  as  described  in  [2]  Chapter  15. 
This  rule  shows  how  to  break  our  problem  into  smaller  independent 
subproblems  if  the  graph  has  a separating  set  which  induces  a ccngilete 
subgraph.  It  may  on  occasion  help  to  organize  proofs  but  once  again  we 
may  easily  check  that  it  does  not  shorten  them. 

Finally  let  us  note  that  all  the  above  discussion  falls  down  if  we 
are  allowed  to  recognize  isomorphic  graphs  with  different  vertex  sets. 
It  would  be  interesting  to  know  what  can  be  said  in  this  case. 


8.  Minimal  Colorings. 

Many  authors  have  investigated  algorithms  A for  (properly)  coloring 
grains  G which  are  fairly  fast  but  which  use  a number  A(g)  of  colors 
possibly  greater  than  ^(g)  . (See  for  example  [9],  [12],  [I5],  [14],) 

A, 

Following  D.  S.  Johnson  [9]  we  let  a(G)  be  the  ratio  of  A(G)  to  )^(G)  , 

^ A 

and  let  A(n)  be  the  maximum  value  of  A(G)  over  graphs  G on  n 

A A A 

vertices.  Clearly  1 < A(n)  < n and  the  smaller  A(G)  or  A(n)  is  the 
better.  In  [9]  it  is  shown  that  for  several  of  the  most  conmon  algorithms 

A 

A the  function  A(n)  is  of  order  n . For  the  best  of  the  known  (fast) 

A 

algorithms  the  function  A(n)  is  still  of  order  n/log  n , 

It  is  suggested  in  [9]  that  the  usual  behavior  of  A(g^)  for  graphs 

A 

on  n vertices  may  be  very  different  from  the  behavior  found  for  A(n) 
We  shall  see  that  this  is  indeed  the  case. 

Consider  first  the  sequential  coloring  algorithm  SA  or  A^^  (see 
[8],  [9]  and  Section  3 of  this  paper).  Johnson  shows  without  difficrilty 
that  AQ^(n)  is  of  order  n , and  suggests  that,  however,  the  eacpected 
value  of  A2^(t^)  ”ia,y  be  bounded  by  a constant  independent  of  n . It 
follows  easily  from  results  in  a paper  [8]  by  G.  Grinmett  and  the  present 
author  that  for  any  e > 0 we  have  5 2+e  for  almost  all  graphs 

Gj^  in  ; also  it  is  easy  to  prove  that  the  expected  value  of  ^(G^^) 
is  at  most  2+e  for  n sufficiently  large  (see  the  proof  of  Theorem  8.2 
below) . 

A 

We  now  look  at  the  usual  behavior  of  A(G^)  for  other  coloring 
algorithms  A . A proper  coloring  of  a graph  G is  minimal  if  for  each 
pair  of  colors  used  some  vertex  of  one  color  is  adjacent  to  some  vertex 
of  the  other  color;  that  is,  if  no  color  can  be  replaced  by  some  other 
already  used  color;  that  is,  if  the  corresponding  proper  partition  Q 


of  G is  such  that  the  contracted  graph  G^  is  ccn^lete.  A coloring 
algorithm  is  minimal  if  it  always  yields  minimal  colorings.  All  the  usual 
coloring  algorithms  are  minimal,  and  in  any  case  from  an  arbitrary  proper 
coloring  one  may  easily  produce  a minimal  coloring.  Thus  it  seems 
reasonable  to  restrict  our  attention  to  minimal  coloring  eilgorithms. 

For  every  graph  G let  M(G)  be  the  maximum  value  of  A(g)  over 
all  minimal  coloring  algorithms  A . An  alternative  definition  of  M(g) 
is  then  that  it  is  the  largest  integer  t for  idiich  there  exists  a proper 
partition  Q of  G into  t sets  such  that  the  contracted  graph  G„ 
is  conQjlete.  For  every  graph  G we  let  M(g)  be  the  ratio  of  M(G) 
to  x(^)  • Th'is  M(G)  is  a measure  of  how  badly  it  is  possible  to 
color  G . 

It  seems  that  for  any  fast  coloring  algorithm  A yet  proposed  there 
exist  graphs  on  which  A performs  very  badly  ([9])»  However,  for  most 
graphs  every  minimal  coloring  algorithm  performs  not  too  badly:  we  shall 

prove  below  that  M(G^)  is  in  probability  only  of  order  (log  n)^/^  . 

Lemma  8.1.  Let  e > 0 . Then  for  eilmost  gi  1 graphs  G^  in 

(1-e)  n (2  log  n)“^/^  < M(G^)  < (1+e) n (log  n)'^/^  . (8.1) 

Further  for  n sufficiently  large  the  expected  value  of  M(G^)  lies 
in  the  above  range. 

Proof.  The  left  hand  inequality  in  (8.1)  follows  immediately  from  Lenina  5.6. 

Let  m be  an  integer  at  least  (1+e) n (log  n)~^^^  . By  Lemma  5.2 
the  probability  that  a given  partition  Q of  {1, ...,n}  into  m sets 
yields  a conqplete  graph  G^  is  at  most 
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Hence  the  probability  that  there  exists  such  a partition  Q 
(proper  or  not)  is  at  most 

n,.  (n/m)^^2^ 

n (1  - q'  / ' ) 

But  now 

log  < n log  n - ( 2 ) log  e q(V“)  < 

if  n is  sufficiently  large.  Hence  for  n sufficiently  large 

Prob{M(G^)  > (l+e) n (log  n)'^/^}  < n q“  . (8.2) 

The  rig^t  hand  inequality  in  (8.1)  follows  from  (8.2),  and  so  we  have 
ccnq)leted  the  proof  of  (8.1). 

The  second  part  of  the  lemma,  concerning  expected  values,  follows 
frcmi  the  left  hand  inequality  in  (8.1)  and  from  (8.2).  □ 

Recall  that  M(G)  is  the  ratio  of  M(G)  to  x(G)  • 

Theorem  8.2.  Let  e > 0 . Then  for  almost  «n  graphs  G^  in 

- e)(log  n)^/^  < M(G^)  < (2+e)(log  n)^/^  . (8.3) 

Further  for  n sufficiently  large  the  expected  value  of  M(G^)  lies  in 
the  above  range. 

Proof.  We  loiow  from  [8]  (see  also  [6]  Chapter  11)  that  for  almost  all 
graphs  G^  in 

1/2  n/log  n < x(C^)  < “ • (8.4) 

Now  (8.3)  follows  from  (8.4)  and  Lemna  8.1. 

The  left  hand  inequality  for  the  expected  value  of  M(O^)  follows 
from  the  left  hand  inequality  in  (8.3).  For  the  right  hand  inequality 
note  first  that 
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(8.5) 


E[M(G^)]  < ( 1/2  n/log  n)“^  E[M(Gj^)] 

+ n Prob{x(G^)  < 1/2  n/log  n} 

But  from  the  proof  of  Lemna  5.2 

n Prob{x(G^)  < 1/2  n/log  n}  -•  0 as  n -•  ob  , (8.6) 

and  by  Lemma  8.1  for  n sufficiently  large 

E[M(Gj^)]  < (1  + e/5)n(log  n)“V2  . (8.7) 

Hence  by  (8.5),  (8.6)  and  (8.7) 

E[M(G^)]  < (2+e)(log  n)^/^ 

for  n sufficiently  large.  This  con5)letes  the  proof  of  this  the  final 
theorem.  □ 
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Some  Questions. 

The  main  result  has  been  that  2^kov  algorithms  for  determining  the 
chromatic  nvimber  of  a graph  in  probability  take  time  at  least 


,n(log  n)"^ 


(for  seme  constant  c > 1 ) 


on  graphs  on  n vertices.  This  result  raises  at  least  three  questions 
that  merit  attention. 

Firstly,  the  best  upper  bound  here  for  the  time  taken  is  very  much 
greater  than  the  lower  bound.  Is  the  lower  bound  of  the  ri^t  order  of 
magnitude? 

Secondly,  all  the  resvilts  here  are  based  on  the  random  graph  model 
which  has  constant  edge-probability  p , and  in  certain  circumstances 
the  model  which  has  constant  average  degree  say  might  be  more  appropriate 
(see  for  example  [6]  Chapter  16).  Are  there  corresponding  results  for 
this  case? 

Thirdly,  it  follows  from  the  discussion  in  Section  7 that  various 
' inQ)rovements ' in  the  Zykov  algorithms  do  not  in  fact  lead  to  a decrease 
in  the  time  taken.  But  what  happens  if  say  we  aOlow  an  isomorphism 
search? 
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